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IN THE CLAIMS: 

1. (Currently amended) A method for tracking multiple objects in a video sequence 
comprising: 

selecting an initial configuration comprising a plurality of objects, where the initial 
configuration represents an initial relationship among the objects in said plurality of 
objects: 

predicting a current nnnftguratio n. where the current con figuration represents a 
current relationship among the objects in said oluralitv of objects; and 
computing a likelihood for the cun^ent configuration, 

2. (Original) The method of claim 1 wherein said predicting step comprises performing 
an object level prediction. 

3. (Original) The method of claim 1 wherein said predicting step comprises perfomiing 
a configuration level prediction. 

4 (Original) The method of claim 3 wherein said configuration level prediction handles 
object addition and deletion from a current configuration, 

5. (Original) The method of claim 1 wherein the predicting step comprises: 

determining a percentage of the objects that are covered by the current 
configuration: 

determining a number of current configurations that correspond to the objects; 

and 

maximizing said percentage and minimizing said number to identify an optimal 
current configuration. 

6. (Previously presented) The method of claim 5 wherein said percentage detemiining 
step comprises solving: 
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7, (Previously presented) The method of claim 5 wherein said percentage determining 
step comprises solving: 




(\jBLa) 
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8. (Original) The method of daim 1 wherein multiple objects in a video sequence are 
represented by said configuration comprising a plurality of modeled objects. 

9. (Original) A method of producing probability distributions of states for multiple objects 
in a video sequence comprising: 

performing hierarchical sampling of at least one frame of video in said video 
sequence, wherein said sampling is performed in an object configuration and individual 
object state s, said object configuration representing a relationship among a p luraiitv of 
objects within a scene: and 

repeating said sampling for each frame of video in said video sequence to track 
objects within the video sequence. 

10. (Canceled) 

11. (Currently amended) A computer readable medium containing a program that, 
when executed by a processor, causes an image processing system to perform a 
method comprising: 

selecting an initial configuration comprising a plurality of objects, where the initial 
configuration represents an Initial relationship am ong the objects In said plurality of 
objects: 
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predicting a current nnnfirjnration . where the c urrent configuration represents a 
current relationship among the oblecta In said plu rality of objects; and 
computing a likelihood for the current configuration. 

12. (Original) The method of claim 11 wherein said predicting step comprises 
performing an object level prediction. 

13. (Original) The method of claim 11 wherein said predicting step comprises 
performing a configuration level prediction. 

14. (Original) The method of claim 13 wherein said configuration level prediction 
handles object addition and deletion from a current configuration. 

15. (Original) The method of claim 1 1 wherein the predicting step comprises: 

detemiining a percentage of the objects that are covered by the current 
configuration; 

determining a number of current configurations that con-espond to the objects; 
maximizing said percentage and minimizing said number to identify an optimal 
current configuration. 

16. (Previously presented) The method of claim 15 wherein said percentage 
determining step comprises solving: 



17. (Previously presented) The method of claim 15 wherein said percentage 
determining step comprises solving: 
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18. (Original) The method of claim 11 wherein multiple objects in a video sequence are 
represented by said configuration comprising a plurality of modeled objects. 
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